Supervised enzyme network inference from the integration of genomic data and chemical information

نویسندگان

  • Yoshihiro Yamanishi
  • Jean-Philippe Vert
  • Minoru Kanehisa
چکیده

MOTIVATION The metabolic network is an important biological network which relates enzyme proteins and chemical compounds. A large number of metabolic pathways remain unknown nowadays, and many enzymes are missing even in known metabolic pathways. There is, therefore, an incentive to develop methods to reconstruct the unknown parts of the metabolic network and to identify genes coding for missing enzymes. RESULTS This paper presents new methods to infer enzyme networks from the integration of multiple genomic data and chemical information, in the framework of supervised graph inference. The originality of the methods is the introduction of chemical compatibility as a constraint for refining the network predicted by the network inference engine. The chemical compatibility between two enzymes is obtained automatically from the information encoded by their Enzyme Commission (EC) numbers. The proposed methods are tested and compared on their ability to infer the enzyme network of the yeast Saccharomyces cerevisiae from four datasets for enzymes with assigned EC numbers: gene expression data, protein localization data, phylogenetic profiles and chemical compatibility information. It is shown that the prediction accuracy of the network reconstruction consistently improves owing to the introduction of chemical constraints, the use of a supervised approach and the weighted integration of multiple datasets. Finally, we conduct a comprehensive prediction of a global enzyme network consisting of all enzyme candidate proteins of the yeast to obtain new biological findings. AVAILABILITY Softwares are available upon request.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prediction of drug–target interaction networks from the integration of chemical and genomic spaces

MOTIVATION The identification of interactions between drugs and target proteins is a key area in genomic drug discovery. Therefore, there is a strong incentive to develop new methods capable of detecting these potential drug-target interactions efficiently. RESULTS In this article, we characterize four classes of drug-target interaction networks in humans involving enzymes, ion channels, G-pr...

متن کامل

Selective Integration of Multiple Genomic Data for Biological Network Inference

In the field of computational biology, recently there has been a surge of interest in biological networks such as protein interaction networks, gene regulatory networks, or metabolic networks, which help us to understand the cellular machinery. Most of biological networks represent the relationships between genes or proteins. Namely the existence of edges means that the corresponding genes/prot...

متن کامل

Oil Extraction from Pistacia Khinjuk - Experimental and Prediction by Computational Intelligence Models

This study investigates the oil extraction from Pistacia Khinjuk by the application of enzyme.Artificial Neural Network (ANN) and Adaptive Neuro Fuzzy Inference System (ANFIS) were applied formodeling and prediction of oil extraction yield. 16 data points were collected and the ANN was trained with onehidden layer using various numbers of neurons. A two-layered ANN provides the best results, us...

متن کامل

Simultaneous inference of biological networks of multiple species from genome-wide data and evolutionary information: a semi-supervised approach

MOTIVATION The existing supervised methods for biological network inference work on each of the networks individually based only on intra-species information such as gene expression data. We believe that it will be more effective to use genomic data and cross-species evolutionary information from different species simultaneously, rather than to use the genomic data alone. RESULTS We created a...

متن کامل

DINIES: drug–target interaction network inference engine based on supervised analysis

DINIES (drug-target interaction network inference engine based on supervised analysis) is a web server for predicting unknown drug-target interaction networks from various types of biological data (e.g. chemical structures, drug side effects, amino acid sequences and protein domains) in the framework of supervised network inference. The originality of DINIES lies in prediction with state-of-the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 21 Suppl 1  شماره 

صفحات  -

تاریخ انتشار 2005